edge intelligence
Embodied Edge Intelligence Meets Near Field Communication: Concept, Design, and Verification
Li, Guoliang, Jin, Xibin, Wan, Yujie, Liu, Chenxuan, Zhang, Tong, Wang, Shuai, Xu, Chengzhong
Realizing embodied artificial intelligence is challenging due to the huge computation demands of large models (LMs). To support LMs while ensuring real-time inference, embodied edge intelligence (EEI) is a promising paradigm, which leverages an LM edge to provide computing powers in close proximity to embodied robots. Due to embodied data exchange, EEI requires higher spectral efficiency, enhanced communication security, and reduced inter-user interference. To meet these requirements, near-field communication (NFC), which leverages extremely large antenna arrays as its hardware foundation, is an ideal solution. Therefore, this paper advocates the integration of EEI and NFC, resulting in a near-field EEI (NEEI) paradigm. However, NEEI also introduces new challenges that cannot be adequately addressed by isolated EEI or NFC designs, creating research opportunities for joint optimization of both functionalities. To this end, we propose radio-friendly embodied planning for EEI-assisted NFC scenarios and view-guided beam-focusing for NFC-assisted EEI scenarios. We also elaborate how to realize resource-efficient NEEI through opportunistic collaborative navigation. Experimental results are provided to confirm the superiority of the proposed techniques compared with various benchmarks.
- Asia > China > Guangdong Province > Shenzhen (0.05)
- Asia > Macao (0.05)
- Asia > China > Hong Kong (0.04)
- (7 more...)
On-Device Training of PV Power Forecasting Models in a Smart Meter for Grid Edge Intelligence
Huang, Jian, Zhu, Yongli, Xu, Linna, Zheng, Zhe, Cui, Wenpeng, Sun, Mingyang
In this paper, an edge-side model training study is conducted on a resource-limited smart meter. The motivation of grid-edge intelligence and the concept of on-device training are introduced. Then, the technical preparation steps for on-device training are described. A case study on the task of photovoltaic power forecasting is presented, where two representative machine learning models are investigated: a gradient boosting tree model and a recurrent neural network model. To adapt to the resource-limited situation in the smart meter, "mixed"- and "reduced"-precision training schemes are also devised. Experiment results demonstrate the feasibility of economically achieving grid-edge intelligence via the existing advanced metering infrastructures.
Edge Intelligence Optimization for Large Language Model Inference with Batching and Quantization
Zhang, Xinyuan, Liu, Jiang, Xiong, Zehui, Huang, Yudong, Xie, Gaochang, Zhang, Ran
Generative Artificial Intelligence (GAI) is taking the world by storm with its unparalleled content creation ability. Large Language Models (LLMs) are at the forefront of this movement. However, the significant resource demands of LLMs often require cloud hosting, which raises issues regarding privacy, latency, and usage limitations. Although edge intelligence has long been utilized to solve these challenges by enabling real-time AI computation on ubiquitous edge resources close to data sources, most research has focused on traditional AI models and has left a gap in addressing the unique characteristics of LLM inference, such as considerable model size, auto-regressive processes, and self-attention mechanisms. In this paper, we present an edge intelligence optimization problem tailored for LLM inference. Specifically, with the deployment of the batching technique and model quantization on resource-limited edge devices, we formulate an inference model for transformer decoder-based LLMs. Furthermore, our approach aims to maximize the inference throughput via batch scheduling and joint allocation of communication and computation resources, while also considering edge resource constraints and varying user requirements of latency and accuracy. To address this NP-hard problem, we develop an optimal Depth-First Tree-Searching algorithm with online tree-Pruning (DFTSP) that operates within a feasible time complexity. Simulation results indicate that DFTSP surpasses other batching benchmarks in throughput across diverse user settings and quantization techniques, and it reduces time complexity by over 45% compared to the brute-force searching method.
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > California > San Diego County > Carlsbad (0.04)
- North America > United States > California > Los Angeles County > Long Beach (0.04)
- (2 more...)
Resource-efficient Generative Mobile Edge Networks in 6G Era: Fundamentals, Framework and Case Study
Lai, Bingkun, Wen, Jinbo, Kang, Jiawen, Du, Hongyang, Nie, Jiangtian, Yi, Changyan, Kim, Dong In, Xie, Shengli
As the next-generation wireless communication system, Sixth-Generation (6G) technologies are emerging, enabling various mobile edge networks that can revolutionize wireless communication and connectivity. By integrating Generative Artificial Intelligence (GAI) with mobile edge networks, generative mobile edge networks possess immense potential to enhance the intelligence and efficiency of wireless communication networks. In this article, we propose the concept of generative mobile edge networks and overview widely adopted GAI technologies and their applications in mobile edge networks. We then discuss the potential challenges faced by generative mobile edge networks in resource-constrained scenarios. To address these challenges, we develop a universal resource-efficient generative incentive mechanism framework, in which we design resource-efficient methods for network overhead reduction, formulate appropriate incentive mechanisms for the resource allocation problem, and utilize Generative Diffusion Models (GDMs) to find the optimal incentive mechanism solutions. Furthermore, we conduct a case study on resource-constrained mobile edge networks, employing model partition for efficient AI task offloading and proposing a GDM-based Stackelberg model to motivate edge devices to contribute computing resources for mobile edge intelligence. Finally, we propose several open directions that could contribute to the future popularity of generative mobile edge networks.
- Asia > China > Guangdong Province > Guangzhou (0.04)
- Asia > South Korea > Gyeonggi-do > Suwon (0.04)
- Asia > Singapore (0.04)
- (3 more...)
- Overview (0.68)
- Research Report (0.64)
- Information Technology > Networks (0.47)
- Information Technology > Security & Privacy (0.46)
MP-FedCL: Multiprototype Federated Contrastive Learning for Edge Intelligence
Qiao, Yu, Munir, Md. Shirajum, Adhikary, Apurba, Le, Huy Q., Raha, Avi Deb, Zhang, Chaoning, Hong, Choong Seon
Federated learning-assisted edge intelligence enables privacy protection in modern intelligent services. However, not independent and identically distributed (non-IID) distribution among edge clients can impair the local model performance. The existing single prototype-based strategy represents a class by using the mean of the feature space. However, feature spaces are usually not clustered, and a single prototype may not represent a class well. Motivated by this, this paper proposes a multi-prototype federated contrastive learning approach (MP-FedCL) which demonstrates the effectiveness of using a multi-prototype strategy over a single-prototype under non-IID settings, including both label and feature skewness. Specifically, a multi-prototype computation strategy based on \textit{k-means} is first proposed to capture different embedding representations for each class space, using multiple prototypes ($k$ centroids) to represent a class in the embedding space. In each global round, the computed multiple prototypes and their respective model parameters are sent to the edge server for aggregation into a global prototype pool, which is then sent back to all clients to guide their local training. Finally, local training for each client minimizes their own supervised learning tasks and learns from shared prototypes in the global prototype pool through supervised contrastive learning, which encourages them to learn knowledge related to their own class from others and reduces the absorption of unrelated knowledge in each global iteration. Experimental results on MNIST, Digit-5, Office-10, and DomainNet show that our method outperforms multiple baselines, with an average test accuracy improvement of about 4.6\% and 10.4\% under feature and label non-IID distributions, respectively.
Codesign of Edge Intelligence and Automated Guided Vehicle Control
Gallage, Malith, Scaciota, Rafaela, Samarakoon, Sumudu, Bennis, Mehdi
Abstract--This work presents a harmonic design of autonomous guided vehicle (AGV) control, edge intelligence, and human input to enable autonomous transportation in industrial environments. The AGV has the capability to navigate between a source and destinations and pick/place objects. The human input implicitly provides preferences of the destination and exact drop point, which are derived from an artificial intelligence (AI) module at the network edge and shared with the AGV over a wireless network. The demonstration indicates that the proposed integrated design of hardware, software, and AI design achieve a technology readiness level (TRL) of range 4-5. The rapid growth of customer demands and increasing costs of resources, labor, and energy have driven industries to seek new technologies that improve productivity and efficiency.
On Achieving Privacy-Preserving State-of-the-Art Edge Intelligence
Chabal, Daphnee, Sapra, Dolly, Mann, Zoltán Ádám
Deep Neural Network (DNN) Inference in Edge Computing, often called Edge Intelligence, requires solutions to insure that sensitive data confidentiality and intellectual property are not revealed in the process. Privacy-preserving Edge Intelligence is only emerging, despite the growing prevalence of Edge Computing as a context of Machine-Learning-as-a-Service. Solutions are yet to be applied, and possibly adapted, to state-of-the-art DNNs. This position paper provides an original assessment of the compatibility of existing techniques for privacy-preserving DNN Inference with the characteristics of an Edge Computing setup, highlighting the appropriateness of secret sharing in this context. We then address the future role of model compression methods in the research towards secret sharing on DNNs with state-of-the-art performance.
- Information Technology > Security & Privacy (1.00)
- Information Technology > Cloud Computing (1.00)
- Information Technology > Data Science > Data Mining > Big Data (0.94)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
MEET: Mobility-Enhanced Edge inTelligence for Smart and Green 6G Networks
Sun, Yuxuan, Xie, Bowen, Zhou, Sheng, Niu, Zhisheng
Edge intelligence is an emerging paradigm for real-time training and inference at the wireless edge, thus enabling mission-critical applications. Accordingly, base stations (BSs) and edge servers (ESs) need to be densely deployed, leading to huge deployment and operation costs, in particular the energy costs. In this article, we propose a new framework called Mobility-Enhanced Edge inTelligence (MEET), which exploits the sensing, communication, computing, and self-powering capabilities of intelligent connected vehicles for the smart and green 6G networks. Specifically, the operators can incorporate infrastructural vehicles as movable BSs or ESs, and schedule them in a more flexible way to align with the communication and computation traffic fluctuations. Meanwhile, the remaining compute resources of opportunistic vehicles are exploited for edge training and inference, where mobility can further enhance edge intelligence by bringing more compute resources, communication opportunities, and diverse data. In this way, the deployment and operation costs are spread over the vastly available vehicles, so that the edge intelligence is realized cost-effectively and sustainably. Furthermore, these vehicles can be either powered by renewable energy to reduce carbon emissions, or charged more flexibly during off-peak hours to cut electricity bills. Yuxuan Sun is with School of Electronic and Information Engineering, Beijing Jiaotong University, Beijing 100044, China, and was previously with Tsinghua University. Bowen Xie, Sheng Zhou (Corresponding Author) and Zhisheng Niu are with Beijing National Research Center for Information Science and Technology, Department of Electronic Engineering, Tsinghua University, Beijing 100084, China.
- Transportation > Ground > Road (0.47)
- Energy > Renewable (0.35)
- Energy > Power Industry (0.34)
Top AI in Business Trends to Look Out For in 2022
Artificial intelligence is thriving in all kinds of businesses in the last few years. There have been multiple speculations about business trends in 2022 with the integration of AI in business. Trends of AI in business are essential to know for business owners to drive higher revenues with better customer engagement while gaining a competitive edge. Thus, let's get to know about some of the top AI in business trends to look out for in 2022. The integration of artificial intelligence in real-time video processing is one of the top AI in business trends in 2022.
Roadmap for Edge AI: A Dagstuhl Perspective
Ding, Aaron Yi, Peltonen, Ella, Meuser, Tobias, Aral, Atakan, Becker, Christian, Dustdar, Schahram, Hiessl, Thomas, Kranzlmuller, Dieter, Liyanage, Madhusanka, Magshudi, Setareh, Mohan, Nitinder, Ott, Joerg, Rellermeyer, Jan S., Schulte, Stefan, Schulzrinne, Henning, Solmaz, Gurkan, Tarkoma, Sasu, Varghese, Blesson, Wolf, Lars
Based on the collective input of Dagstuhl Seminar (21342), this paper presents a comprehensive discussion on AI methods and capabilities in the context of edge computing, referred as Edge AI. In a nutshell, we envision Edge AI to provide adaptation for data-driven applications, enhance network and radio access, and allow the creation, optimization, and deployment of distributed AI/ML pipelines with given quality of experience, trust, security and privacy targets. The Edge AI community investigates novel ML methods for the edge computing environment, spanning multiple sub-fields of computer science, engineering and ICT. The goal is to share an envisioned roadmap that can bring together key actors and enablers to further advance the domain of Edge AI.
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- Europe > Netherlands > South Holland > Delft (0.04)
- Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)
- (4 more...)
- Information Technology > Security & Privacy (1.00)
- Energy (1.00)